PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopim03g120620.0.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 776aa    MW: 86180.1 Da    PI: 6.3465
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopim03g120620.0.1genomeCSHLView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox64.91.1e-20113167256
                         T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         +k +++t +q++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Sopim03g120620.0.1 113 KKYHRHTVQQIREMEALFKESPHPDEKQRQQLSKQLGLHPRQVKFWFQNRRTQIK 167
                         78899**********************************************9877 PP

2START211.23.8e-662865123206
                         HHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.... CS
               START   3 aeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla.... 77 
                          ++a+++l+k+a+++ep+W ks     e++n+de++++f++ +           +ea+r++g+v+m+l++lv++++d++ qW e+++    
  Sopim03g120620.0.1 286 VNQAMEQLQKMATSGEPLWIKSFetgrEILNYDEYTKEFPPIDKsgdvkskiMGIEASRDTGIVFMELPRLVQTFMDVN-QWREMFPsmis 375
                         5789******************99***************997778999***9***************************.*********** PP

                         EEEEEEEECTT........EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECT CS
               START  78 kaetlevissg........galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksn 159
                         ka+t++vi++g        ga+qlm+ae q+l+p+v  R+++fvRy++q++a +w ivdvSvd  +    ++s+ ++++lpSg+++++ sn
  Sopim03g120620.0.1 376 KAATVDVICNGteganswdGAIQLMFAEVQMLTPVVGtREVYFVRYCKQMSAAQWGIVDVSVDKVEASI-DASLLKCRKLPSGCILQEQSN 465
                         *******************************************************************87.9******************** PP

                         CEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
               START 160 ghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          h+kvtwveh +++++++++l+r++v+sg+a+ga++w+atlq+qce+
  Sopim03g120620.0.1 466 AHCKVTWVEHLECQKNIVDSLYRVTVNSGQAFGARRWMATLQQQCER 512
                         *********************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.6E-2196163IPR009057Homeodomain-like
SuperFamilySSF466894.6E-2097170IPR009057Homeodomain-like
PROSITE profilePS5007117.378109169IPR001356Homeobox domain
SMARTSM003899.1E-18111173IPR001356Homeobox domain
PfamPF000466.1E-18113167IPR001356Homeobox domain
CDDcd000861.62E-16116167No hitNo description
PROSITE patternPS000270144167IPR017970Homeobox, conserved site
PROSITE profilePS5084835.017275515IPR002913START domain
SuperFamilySSF559613.98E-30277512No hitNo description
CDDcd088752.30E-103279511No hitNo description
Gene3DG3DSA:3.30.530.204.5E-5282508IPR023393START-like domain
SMARTSM002345.5E-59284512IPR002913START domain
PfamPF018521.7E-53286512IPR002913START domain
SuperFamilySSF559615.36E-11555739No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 776 aa     Download sequence    Send to blast
MGVVDMSNNP PPHETKDFFP SPALSLSLAG IFRDGVGAGS SAGNMETTEE VEEGSAAGSR  60
GVRPREETST VEISSENSEP MRSRGSDDDL EHDDTCNEDE EDPNNNSKKK KQKKYHRHTV  120
QQIREMEALF KESPHPDEKQ RQQLSKQLGL HPRQVKFWFQ NRRTQIKAIQ ERHENSLLKA  180
EIEKLREENK GLRGNSKNPS CPNCGFASST NNAPTLPAEE QQLRIENARL RAEVEKLRAA  240
LGKYQIGTSP NSSSSCSGGN DEENKSALDF YTGIFGLEKP RIMHIVNQAM EQLQKMATSG  300
EPLWIKSFET GREILNYDEY TKEFPPIDKS GDVKSKIMGI EASRDTGIVF MELPRLVQTF  360
MDVNQWREMF PSMISKAATV DVICNGTEGA NSWDGAIQLM FAEVQMLTPV VGTREVYFVR  420
YCKQMSAAQW GIVDVSVDKV EASIDASLLK CRKLPSGCIL QEQSNAHCKV TWVEHLECQK  480
NIVDSLYRVT VNSGQAFGAR RWMATLQQQC ERLLFFMATN IPTKDTTGVA TLAGRKSILT  540
LAQRMTRGFY RVLGASSYNT WNKIPSKTGQ EDIRVISRRN LTDPGEPQGL ILCAASSIWL  600
PVSRNVLFDF LKDENHRHEW DVMSNGGPVQ SVANLAKGQD KGNAVSIQAV KLRENNMWIL  660
QDTSTNAYES AVVYAPVDIA GMQSVITGCD SSNIAALPSG FSILPDGLES RPFVITSRPE  720
DRSSEGGSLL TVAFQILTSN STTAKLSKES VESINNLLSC TLHKIKTRFQ CDNGY*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755150.0HG975515.1 Solanum lycopersicum chromosome ch03, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004235676.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLK4BML10.0K4BML1_SOLLC; Uncharacterized protein
STRINGSolyc03g120620.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA62282328
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein